
SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: TUCKER et al . 

(ii) TITLE OF INVENTION: MORAXELLA CATARRHAL I S OUTER MEMBRANE 
PROTEIN-106 POLYPEPTIDE, GENE SEQUENCE AND USES THEREOF 

(iii) NUMBER OF SEQUENCES : 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PENNIE & EDMONDS LLP 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

- - - | E -j COUNTRY": " USA" " ~ " " 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

C3 (C) OPERATING SYSTEM: PC-DOS/MS -DOS 

v3 (D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

in 

(vi) CURRENT APPLICATION DATA: 

yj (A) APPLICATION NUMBER: 08/968,685 

lf§1 (B) FILING DATE: November 12, 1997 

ll (C) CLASSIFICATION: 

^ (viii) ATTORNEY /AGENT INFORMATION: 

^ (A) NAME: Baldwin, Geraldine F. 

P (B) REGISTRATION NUMBER: 31,232 

Ul (C) REFERENCE/DOCKET NUMBER: 7969-060 



ry 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 7 90-9090 

(B) TELEFAX: (212) 869-8864 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

lie Gly lie Ser Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg 
1 5 10 15 
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Gly Asp Lys Ser He Ala He Gly Asp He Ala Gin Ala Leu Gly Ser 
20 25 

Gin Ser He Ala He Gly Asp Asn Lys He Val 

a n 



35 40 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
. (v) FRAGMENT TYPE :_ internal _ _ _ _ _ _ ______ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Gly Thr Val Leu Gly Gly Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "probe" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
GGNACNGTNC TNGGNGGNAA RAAR 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 72 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 : 
GAA GCG GAC GGG GGG AAA GGC GGA GCC AAT GCG CGC GGT GAT AAA TCC 
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ATT GCT ATT GGT GAC ATT GCG CAA 
He Ala He Gly Asp He Ala Gin 
20 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 5 : 

Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg Gly Asp Lys Ser 
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He Ala He Gly Asp He Ala Gin 



n 20 



f| (2) INFORMATION FOR SEQ ID NO : 6 : 



?* (i) SEQUENCE CHARACTERISTICS: 

W (A ) LENGTH: 24 base pairs 

iiU (g) TYPE: nucleic acid 

H (C) STRANDEDNESS : unknown 

*5 (D) TOPOLOGY: unknown 

h (ii) MOLECULE TYPE: other nucleic acid 

Q (A) DESCRIPTION: /desc = "probe 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
YTTYTTNCCN CCNAGNACNG TNCC 
(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " "probe"" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GGNACNGTNT TRGGNGGNAA RAAR 
(2) INFORMATION FOR SEQ ID NO : 8 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic add 
(A) DESCRIPTION: /desc = "probe 

(v) FRAGMENT TYPE: N-terminal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
YTTYTTNCCN CCYAANACNG TNCC 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CTATTGACTT AAATCACCAT ATGGTTATAA TTTAGCATAA TGGTAGGCTT TTTGTAAAAA 
TCACATCGCA ATATTGTTTT ACTGTTACTA CCATGCTTGA ATG ACGATC C AAATCACCAG 
ATTCATTCAA GTGATGTGTT TGTATACGCA CCATTTACCC TAATTATTTC AATCAAATGC 
CTATGTCAGC ATGTATCATT TTTTAAGGTA AACCACCATG AATCACATCT ATAAAGTCAT 
CTTTAACAAA GCCACAGGCA CATTTATGGC CGTGGCGGAA TATGCCAAAT CCCACAGCAC 
GGGGGGGGGT AGCTGTGCTA CAGGGCAAGT TGGCAGTGTA CGCACTCTGA GCTTTGCCCG 
TATTGCCGCG CTCGCTGTCC TCGTGATCGG TGCGACGCTC AATGGCAGTG CTTATGCAGG 
TATCGGAATT AGTGAAGCAG ACGGGGGAAA AGGCGGAGCC AATGCGCGCG GTGATAAATC 
CATTGCTATT GGTGATATTG CTCAGGCACT TGGCTCTCAA TCTATTGCTA TCGGTGACAA 
CAAAATAGTT CATAATTCAA ATAATAATGC TAATATAGGT GCCAAAGCCT CAGGTAATGA 
GT CCATCGCC ATCGGTGGTG ATGTATTGGC TTCTGGTCAT GCCTCGATTG CCATCGGTAG 
TGATGACTTA TATTTGAAAA AGGAAACGGT ACAGCAAATC TCAGAGCTTC TACCTATTAT 
TCGCGGACAG AAAGCATTAA ACGATATATA CCAACTAGCT GACACTAATC TTCAAAAATA 
TAGACGCACA CACGCACAGG GACACGCCAG TACTGCAGTG GGAGCCATGT CATATGCAAA 
GGGTCATTTT TCCAACGCCT TTGGTACACG GGCAACAGCT GAAGGTACCT ATTCCTTGGC 
AGTGGGTCTT ACCGCCACAG CCAAAGCAGC ATCTTCAATC GCTGTTGGTT CTAATGCACA 
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AGCTATCGGG TTTGCAGCGA CAGCCGTTGG TGGAAGTACT CAAGTTAATT TGAATCGAGG 
TATTGCCCTA GGTTTTGGTT CTCAGGTCCT TCAGAAGGAT AATGATGTAA ATGCAGCAAA 
TGTACGGGCC TATGCACCAG ATGATAACCA GCCAATAGAC AACCGGTATA AAGCCACCTT 
CAAGAATGGT GCTACGGATG TATTTTCCAT TGGTAATAGT AATGGGAATG ACAGTATCAG 
GCGTAAAATC ATCAATGTCG GTGCAGGTTC TGCGGATACC GATGCGGTCA ATGTGGCACA 
GCTTAAAGAG GCGGTGAGGC TGGCTAATCG TCAAATTACT TTTAAGGGTG ATGATAGCAA 
TAATAGAGTA GAAAAAGGTT TGGGCAAGAC TTTAACTATC ACAGGTGGTG CACAGACCAG 
CGCATTAACC GATCATAACA TCGGTGTGGT ACAAAATGGC GATGGTCTGA AAGTTCAACT 
■TGCTGAAACT .TTAACCAGCC TTAAAATGGT TACCACTGAA AACCTAACCG CCAACGAGAA 
AGTTACCGTA GGCAAAACCC GCCTTACCAC AGATAAAATT GGTTTTACCA ATGATATGAA 
TGGCATTGAT GAAAGCAAAC CTTATCTTGA TAAAGACACT GGCATTCATG CAGGTGGTCA 
AAAGATTACC AAACTTACTG CTGGTGTAGT AGATGACGAT GCGGCAACTT ATGGACAGCT 
f| TAAAAAAGTT AACCAAACCG CTGAAAGTGC TCTACAAACC TTT AC CGTTA AAAAGGTAGA 1740 

TAAAAATGGT AATGATGCTA ATGACAGCAA AATCATCACC GTGGGTAAAA ATAACAAACC 1800 
AGACGGTACT CAAGTCAACA CCCTAAAACT CAAAGGTGAA AACGGTGTTG ATGTTACAAC I860 
CGAAACAAAT GGTACAGTTA CCTTTGGGCT TAACCAAAAT AACGGTCTGA CCGTTGGCAA 1920 
CAGCACCCTA AACAACGATG GCTTATCTGT TAAAAACACC AATAGTAACA AACAAATCCA 1980 
AGTCGGTGCT GATGGCATTA CATTTACTGA TATCAGCAAT AGTAAGCCAG GTGCTGGCAT 2040 
TGAAAATACC ACTCGCATTA CCAGAGACGG TATTGGTTTT GCTAATAATA CTGGTTCATT 2100 
3 GGATGCAAAC AAACCCCGCC TAACCCCAAC TGGCATTAAC GCAGGTGGTA AAGAGCTGAC 2160 

CAATGTCCAA TCTGCCATTA ACCCTGCTAC CAATGGTGGG CAGCTAGACT TTATGAACCG 
CCTAAGCACT GCTAATACCG AAAAATCAGG CTCTGCCGCC ACCATTAAAG ACTTATACAA 
CCTATCACAA GTACCGCTGA CCTTTGCAGG TGATACAGGT CCTAATGTCA CCAAAAAACT 
GGGCGAGATT TTAAAGGTTA AAGGTGGTAA AACCACAGCT GATGATTTAA CCAAAAATAA 
CATCGGTGTG GTGGCTGATA GTACCGATAA TAGCTTAACC GTTAAACTTG CTAAAACTTT 
AAGCGATCTT GATGCGGTTA ATACTAAAAC CCTAACTGCC AGCGATAAAG TTACCGTAGA 
CAGTGGCAAC AACACCGCTA AGCTACAAAA TGGTGATTTA ACCTTTAGCA AACAAAATAC 
AGGTGCTACC CCTGCCACCA ACAGCAAAAC CATCTATGGC GTTGATGGCT TGAAGTTTAC 
TGATAACAAT GGTATAGCAC TTGACGGCAC AACTTACATC ACCAAAGACA AAGTTGGCTT 
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TGCTAAGCAA GATGGTTCAC TTGATAAAAG CAAACCTTAT CTTGATAAGG ACAAGCTAAA 
AGTGGGTGAA GTTGAGATTA CCACCAACGG CATTAATGCA GGTGGTAAAG CCATCACAGG 
ACTAAGCAAT ACCCTAACCG ATGCCACCAA CGCAACAACA GGGCATGTAA CTCAATTGGG 
TATCGTTGAT AGTACTGACA AAACCCGTGC CGCCAGCATT GGTGATGTGC TAAACGCAGG 
CTTTAACCTA AAAAATAATG GTGACGCCAA AGACTTTGTC TCCACTTATG ACACTGTTGA 
TTTTATCAAT GGCAATGCCA CCACCGCTAA AGTCACTTAT GATGGCAAAG CCAGTAAAGT 
GGCGTATGAT GTCAATGTGG ATGGTACAAC CATTCATCTA ACAGGCGCTG ATGGCAATAA 
AAACCAAATT GGCGTAAAAA CCACCACACT GACCAAAACA GATGCTAAAG GTGATAAAGC 
- - AATTAACTTT AGTGTTAACT CTGGTGATGA CAAAGCCCrr ATTAACGCCA AAGACATCGC 
CGACAATCTA AACACCCTAG CTGGTGAAAT TCGCAACACC AAAGGCACAG CAGACACCGC 
CCTACAAACC TTTCAAGTCA AAAAAGTCAA AGAAAATGGT GATGATGATA ATGACGCTGA 
CACCATCACC GTGGGTAAAG ATGCAAAAAC CAATCAAGTC AACACCCTAA AACTCAAAGG 
TAAAAACGGT CTTGATATTC AAACCAATAA AGATGGTACG GTTACCTTTG GCATTAACAC 
CCAAAGCGGT CTTAAAGCCG GCAACAACAC CACTCTAAAC AACAATGGCT TGTCTATTAA 
A^CACCGCT GGTAACGAAC AAATCCAAGT CGGTGCTGAT GGCGTGAAGT TTGCCAAGGT 
TAATAATGGT GTTGTAGGTG CTGGCATTGA TGGCACAACT CGCATTACCA GAGATGAAAT 
TGGCTTTGCT GGGACTAATG GCTCACTTGA TAAAAGCAAA CCCCACCTAA GCAAAGACGG 



iu CATTAACGCA GGTGGTAAAA AGATTACCAA CATTCAATCA GGTGAGATTG CCCAAAACAG 3780 

CAATGATGCT GTGACAGGCG GCAAGATTTA TGATTTAAAA ACCGAACTTG AAAACAAAAT 3840 

CAGCAGTACT GCCAAAACAG CACAAAACTC ATTACACGAA TTCTCAGTAG CAGATGAACA 3900 

AGGTAATAAC TTTACGGTTA GTAACCCTTA CTCCAGTTAT GACACCTCAA AGACCTCTGA 3960 

XGTCATCACC TTTGCAGGTG AAAACGGCAT TACCACCAAG GTAAATAAAG GTGTGGTGCG 4020 

TGTGGGCATT GACCAAACCA AAGGCTTAAC CACGCCTAAG CTGACCGTGG GTAATAATAA 4080 

TGGCAAAGGC ATTGTCATTG ACAGCCAAAA TGGTCAAAAT ACCATCACAG GACTAAGCAA 4140 

CACTCTAGCT AATGTTACCA ATGATAAAGG TAGCGTACGC ACCACAGAAC AGGGCAAGAT 4200 

AATCAAAGAC GAAGACAAAA CCCGTGCCGC CAGCATTGTT GATGTGCTAA GCGCAGGCTT 4260 

TAACTTGCAA GGCAATGGTG AAGCGGTTGA CTTTGTCTCC ACTTATGACA CTGTCAACTT 4320 
TGCCGATGGC AATGCC AC C A CCGCTAAGGT GACCTATGAT GACACAAGCA AAACCAGTAA 4380 
AGTGGTCTAT GATGTCAATG TGGATGATAC AACCATTGAA GTTAAAGATA AAAAACTTGG 4440 
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CGTAAAAACC ACCACATTGA CCAGTACTGG CACAGGTGCT AATAAATTTG CCCTAAGCAA 4500 
TCAAGCTACT GGCGATGCGC TTGTCAAGGC CAGTGATATC GTTGCTCATC TAAACACCTT 4560 
ATCTGGCGAC ATCCAAACTG CCAAAGGGGC AAGCCAAGCG AACAGCTCAG CAGGCTATGT 4620 
GGATGCTGAT GGCAATAAGG TCATCTATGA CAGTACCGAT AACAAGTACT ATCAAGCCAA 4680 
AAATGATGGC ACAGTTGATA AAACCAAAGA AGTTGCCAAA GACAAACTGG TCGCCCAAGC 4740 
CCAAACCCCA GATGGCACAT TGGCTCAAAT GAATGTCAAA TCAGTCATTA ACAAAGAACA 4800 
AGTAAATGAT GCCAATAAAA AGCAAGGCAT CAATGAAGAC AACGCCTTTG TTAAAGGACT 
TGAAAAAGCC GCTTCTGATA ACAAAACCAA AAACGCCGCA GTAACTGTGG GTGATTTAAA 
TGCCGTTGCC CAAACACCGC TGACCTTTGC AGGGGATACA GGCACAACGG CTAAAAAACT 
GGGCGAGACT TTGACCATCA AAGGTGGGCA AACAGACACC AATAAGCTAA CCGATAATAA 
CATCGGTGTG GTAGCAGGTA CTGATGGCTT CACTGTCAAA CTTGCCAAAG ACCTAACCAA 
TCTTAACAGC GTTAATGCAG GTGGTACCAA AATTGATGAC AAAGGCGTGT CTTTTGTAGA 
C =j CTCAAGCGGT CAAGCCAAAG CAAACACCCC TGTGCTAAGT GCCAATGGGC TGGACCTGGG 
5 TGGCAAGGTC ATCAGCAATG TGGGCAAAGG CACAAAAGAC ACCGACGCTG CCAATGTACA 

W ACAGTTAAAC GAAGTACGCA ACTTGTTGGG TCTTGGTAAT GCTGGTAATG ATAACGCTGA 

U CGGCAATCAG GTAAACATTG CCGACATCAA AAAAG AC C C A AATTCAGGTT CATCATCTAA 

f CCGCACTGTC ATCAAAGCAG GCACGGTACT TGGCGGTAAA GGTAATAACG ATACCGAAAA 

S ACTTGCCACT GGTGGTGTAC AAGTGGGCGT GGATAAAGAC GGCAACGCTA ACGGCGATTT 

AAGCAATGTT TGGGTCAAAA CCCAAAAAGA TGGCAGCAAA AAAGCCCTGC TCGCCACTTA 
TAACGCCGCA GGTCAGACCA ACTATTTGAC CAACAACCCC GCAGAAGCCA TTGACAGAAT 
AAATGAACAA GGTATCCGCT TCTTCCATGT CAACGATGGC AATCAAGAGC CTGTGGTACA 
AGGGCGTAAC GGCATTGACT CAAGTGCCTC AGGCAAGCAC TCAGTGGCGA TAGGTTTCCA 
GGCCAAGGCA GATGGTGAAG CCGCCGTTGC CATAGGCAGA CAAACCCAAG CAGGCAACCA 5820 
ATCCATCGCC ATCGGTGATA ACGCACAAGC CACAGGCGAT CAATCCATCG CCATCGGTAC 5880 
AGGCAATGTG GTAGCAGGTA AGCACTCTGG TGCCATCGGC GACCCAAGCA CTGTTAAGGC 5940 
TGATAACAGT TACAGTGTGG GTAATAACAA CCAGTTTACC GATGCCACTC AAACCGATGT 6000 
CTTTGGTGTG GGCAATAACA TCACCGTGAC CGAAAGTAAC TCGGTTGCCT TAGGTTCAAA 6060 
CTCTGCCATC AGTGCAGGCA CACACGCAGG CACACAAGCC AAAAAATCTG ACGGCACAGC 6120 
AGGTACAACC ACCACAGCAG GTGCAACCGG TACGGTTAAA GGCTTTGCTG GACAAACGGC 6180 
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GGTTGGTGCG GTCTCCGTGG GTGCCTCAGG TGCTGAACGC CGTATCCAAA ATGTGGCAGC 
AGGTGAGGTC AGTGCCACCA GCACCGATGC GGTCAATGGT AGCCAGTTGT ACAAAGCCAC 
CCAAGGCATT GCCAACGCAA CCAATGAGCT TGACCATCGT ATCCACCAAA ACGAAAATAA 
AGCCAATGCA GGGATTTCAT CAGCGATGGC GATGGCGTCC ATGCCACAAG CCTACATTCC 
TGGCAGATCC ATGGTTACCG GGGGTATTGC CACCCACAAC GGTCAAGGTG CGGTGGCAGT 
GGGACTGTCG AAGCTGTCGG ATAATGGTCA ATGGGTATTT AAAATCAATG GTTCAGCCGA 
TACCCAAGGC CATGTAGGGG CGGCAGTTGG TGCAGGTTTT CACTTTTAAG CCATAAATCG 
CAAGATTTTA CTTAAAAATC AATCTCACCA TAGTTGTATA AAACAGCATC AGCATCAGTC 
ATATTACTGA TGCTGATGTT TTTTATCACT TAAACCATTT TACCGCTCAA GTGATTATCT 
TTCACCATGA CCAAATCGCC ATTGATCATA GGTAAACTTA TTGAGTAAAT TTTATCAATG 
TAGTTGTTAG ATATGGTTAA AATTGTGC C A TTGAC CAAAA AATTACCGAT TTATCCCGAA 
AATTTCTGAT TATGATCACT TTTCATAAAT TTCCCCAATT TGTCTTTATA AATATCCCAA 
GAAATGGTAT TATTTTATTG CCATCAGCAT ATGCGACAAC TCATCGTATC ATCTTTTTAT 
CATAAAAATG CAAATAGGCA TATGCATTTT TTGAATTGAA CTTACGCACT GAGAGATCCC 
CTCATAATTT CCCCAAAGCG TAACCATGTG TGAATAAATT TTGAGCTAGT AGGGTTGCAG 
CCACGAGTAA GTCTTCCCTT GTTATTGTGT AGCCAGAATG CCGCAAAACT TCCATGCCTA 
AGCGAACTGT TGAGAGTACG TTTCGATTTC TGACTGTGTT AGCCTGGAAG TGCTTGTCCC 
AACCTTGTTT CTGAGCATGA ACGCCCGCAA GCCAACATGT TAGTTGAAGC ATCAGGGCGA 
TTAGCAGCAT GATATCAAAA CGCTCTGAGC TGCTCGTTCG GCTATGGCGT AGGCCTAGTC 
CGTAGGCAGG ACTTTTCAAG TCTCGGAAGG TTTCTTCAAT CTGCATTCGC TTCGAATAGA 
TATTAACAAG TTGTTTGGGT GTTCGAATTT CAACAGGTAA GTTAGTTGCT AGAATCCATG 
GCTCCTTTGC CGACGCTGAG TAGATTTTAG GTGACGGGTG GTGACAATGA GTCCGTGTCG 
AGCGCTGATT TTTTCGGCCT TTAGAGCGAG ATTTATACAA TAGAATTTGG CATGAGATTG 
GATTGCTTTT AGTCAGCCTC TTATAGCCTA AAGTCTTTGA GTGACTAGAT GACATATCAT 
GTAAGTTGCT GATAGGTTTC CAGTTTTCCG CTCCTAGGTC TGCATATTGT ACTTTTCCTC 
TTACTCGACT TAACCAGTAC CAACCCAGCT TCTCAACGGA TTT ATAC CAT GGCACTTTAA 
AGCCAGCATC ACTGACAATG AGCGGTGTGG TGTTACTCGG TAGAATGCTC GCAAGGTCGG 
CTAGAAATTG GTCATGAGCT TTCTTTGAAC ATTGCTCTGA AAGCGGGAAC GCTTTCTCAT 
AAAGAGTAAC AGAACGACCG TGTAGTGCGA CTGAAGCTCG CAATACCATA AGCCGTTTTT 
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GCTCACGGAT ATCAGACCAG TCAACAAGTA CAATGGGCAT CGTATTGCCC GAACAGATAA 7980 
AGCTAGCATG CCAACGGTAT ACAGCGAGTC GCTCTTTGTG GAGGTGACGA TTACCTAACA 8040 
ATCGGTCGAT TCGTTTGATG TTATGTTTTG TTCTCGCTTT GGTTGGCAGG TTACGGCCAA 8X00 
GTTCGGTAAG AGTGAGAGTT TTACAGTCAA GTAAGGCGTG GCAAGCCAAC GTTAAGCTGT 8X60 
TGAGTCGTTT TAAGTGTAAT TCGGGGCAGA ATTGGTAAAG AGAGTCGTGT AAAATATCGA 8220 
GTTCGCACAT TTTGTTGTCT GATTATTGAT TTTTGGCGAA ACCATTTGAT CATATGACAA 8280 
GATGTGTATC TACCTTAACT TAATGATTTT GATAAAAATC ATTAGGGGAT TCATCAGACT 
TACGCATCTT TCATTATGGG AATTAGGTCA GTAATTATGA CAAAAAATTA TGCATTATTA 
TCCGTCTCAG ATAAAACGCA AATCGTTGAA TTTGCCCAAG GTTTGGTAGA ATCTGGCTTT 
GGTATTTTAT CCACAGGTGG TACTTTTAAA CTCTTAAAAG AACATGGGAT TGACGCCATT 
GAGGTTTCTG CCCATACAGG TTTTGCTGAA ATGATGGATG GTCGTGTTAA GACCCTACAT 
CCCAAAATTC ATGGTGGTAT TTTGGGCCGT CGTGGCATTG ATGATGCCAT TATGAATGAA 
CATGGCATTG ATCGCATTGA TATCGTTGTC GTGAATTTAT ATCCATTTGC CAACACGGTC 
GCCAAAGACG GTGTTGTTAT GTCTGATGCG ATTGAAAATA TTGATATTGG TGGGCCTGCT 
ATGGTACGCT CAGCCGCCAA AAATCATGCC CATGTTGGTA TTATCACCAG CCCAAATGAC 
TACTCACGCA TCCTAGATGA ACTAAAAAAC CAAGGTCATT TAAGCCACAA CACTCGTTTT 
GATTTGGCAG TCAAAGCATT TGAACACACT GCCGCCTATG ATGGTATGAT TGCCAGCTGG 
CTAGGTGCAC GCTTACCAGT GGATAAAGAG ACGGCACCCA GTGATGATGC CACTGCAACC 
ACTCAATTTT CACGCACTTT TAATCACCAA TTCACCAAAG CACAAGAGCT TAGATATGGC 
G AAAACC C AC ATCAGTCAGC AGCCTTTTAT GTAGATGATC ATGCAACAGA AGCGTCTGTT 
GCGACTGCAC AGCAATTACA AGGTAAAGCG TTGTCTTATA ATAATATTGC TGATACCGAT 9180 
GCGGCACTTG AGTGTGTCAA ATCTTTTACC ACGCCTGCTT GTGTGATTGT CAAACATGCC 9240 
AATCCTTGTG GTGTTGCAAC ATCAGAAAAC GGTATTTTAG ATGCTTATCA CTTAGCATAT 
GCAACCGATC CTGAATCTGC CTTTGGTGGC ATTATTGCCT TTAACCGAGA ATTAGACAGT 
GATACAGCCC GTACCATCGT TGAGCGTCAA TTTGTTGAAG TCATCATCGC ACCAAGCATC 
GCTGAAGGTG TTCTAGAGCG GCCGCGGGCC CATCGATTTT CCACCCGGGT GGGGTACCAG 
GTAAGTGTAC CCAATTCGCC CTATAGTGAG TCGTATTACA ATTCACTGGC CGTCGTTTTA 
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# 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
Met Asn His He Tyr Lys Val He Phe Asn Lys Ala Thr Gly Thr Phe 
Met Ala val Ala Glu Tyr Ala Lys Ser His Ser Thr Gly Gly Gly Ser 
Cys Ala Thr lly Gin Val Gly Ser Val Arg Thr Leu Ser Phe Ala Arg 
Il-e -Ala Ala. Leu. Ala Val Leu yll He Gly Ala Thr Leu Asn Gly Ser 
Ala Tyr Ala Gly He Gly 'lie Ser Glu Ala Asp Gly Gly Lys Gly Gly 

Ala Asn Ala Arg Gly Asp Lys Ser He Ala He Gly Asp He Ala Gin 

85 

Ala Leu Gly Ser Gin Ser He Ala He Gly Asp Asn Lys lie Val Hi. 
Asn Ser Asn Asn Asn Ala Asn lie Gly Ala Lys Ala ser Gly Asn Glu 
Ser He III He Gly Gly Asp III Leu Ala Ser Gly His Ala Ser He 
Ala He Gly Ser Asp As P III Tyr Leu Lys Lys Glu Thr Val Gin Gin 
III ser Glu Leu Leu Pro He He Arg Gly Gin Lys Ala Leu Asn Asp 
He Tyr Gin Leu III Asp Thr Asn Leu Gin Lys Tyr Arg Arg Thr His 
Ala Gin Gly His Ala Ser Thr Ala Val Gly Ala Met Ser Tyr Ala Lys 
Oly His III Ser Asn Ala Phe Gly Thr Arg Ala Thr Ala Glu Gly Thr 
Ty r III Leu Ala Val Gly Leu Thr Ala Thr Ala Lys Ala Ala Ser Ser 
III Ala val Gly Ser Asn Ala Gin Ala He Gly Phe Ala Ala Thr Ala 

q A c z bU 

val Gly Gly Ser Thr Gin Val Asn Leu Asn Arg Gly He Ala Leu Gly 
Phe Gly Ser III Val Leu Gin Lys Asp Asn Asp Val Asn Ala Ala Asn 
val Arg Vll Tyr Ala Pro Asp Asp Asn Gin Pro lie Asp Asn Arg Tyr 
Lys lla Thr Phe Lys Asn Gly Ala Thr Asp val Phe Ser He Gly Asn 
III Asn Gly Asn Asp Ser Xle Arg Arg Lys He Xle Asn Val Gly Ala 
Oly ser Ala Asp Tnr Asp Ala Val Asn Val Ala Gin Leu Lys Glu Ala 
val Arg Leu Ala Asn Arg Gin He Thr Phe Lys Gly Asp Asp Ser Asn 
Asn Arg Val Glu Lys Gly Leu Gly Lys Thr Leu Thr Xle Thr Gly Gly 
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Al, Gin Thr S er Al, Leu Thr Asp Hi. A.. IX. Oly v.l v.l Gin Asn 
Gly Asp Gly Leu Ly. SE Gin Leu M. ™ Leu Thr Ser Leu Lys 
Met V.l Thr Thr ill Ash Leu Thr Al, Ash al« Ly. V,l Thr v.1 OXy 
Lys Thr Arg Leu Thr Thr Asp Lys 11= Gly Phe Thr Ash Asp Met A.„ 
G1 y He Asp Glu Ser Lys Pro Tyr Leu Asp Lys Asp Thr Gly lie His 
Al, ill Gly Gin Lys He £ Ly. Leu Thr Al, Gly v,l V.l Asp Asp 
IS Al. »a Thr Tyr Gly Gl„ Leu Lys Ly. V,l As„ Gl» Thr Al, Glu 
ser Al, Leu Gin Thr Phe Thr v,l Lys Ly. V,l Asp Lys A.„ Gly As„ 
- Asp Al, AS„ MS ser Lys He He Thr v.l Gly Lys Ash Aon Ly. Pro 
Asp Gly Sr Gin v,l A.„ Thr III Lys Leu Ly. Gly Glu Asn Gly v,l 
Asp III Thr Thr Glu Thr Asn Gly Thr v,l Thr Phe Gly Leu Asn Gin 
III Asn Gly Leu Thr III Gly Asn Ser Thr Leu Asn Asn Asp Gly Leu 
ser V,l Lys Asn Tnr Asn ser A.n Ly. Oln He Gin v.l Gly Al, Asp 
Gly He Thr III Thr Asp He Ser Asn Ser Lys Pro Gly Al, Gly He 
Glu Ash III Thr Arg He Thr Z Asp Gly He Gly Phe Al, Asn A.n 
Thr III ser Leu Asp Al, Asn Lys Pro Arg Leu Thr Pro Thr Gly II. 
Ill Al, Gly Gly Ly. HI Leu Thr Asn v.l Gin ser Al, He Asn Pro 
Al, Thr Asn Gly "ly Gin Leu Asp Phe Met Asn Arg Leu .» Thr Al, 
A.n Thr Glu Lys ser Gly Ser Al, Al, Thr He Ly. A.p Leu Tyr A.n 
Leu ser "in V.l Pro Leu Thr Phe Ala Gly Asp Thr Gly Pro A.n V.l 
Thr III Lys Leu Gly Glu III Leu Ly. v.l Lys Gly Gly Lys Thr Thr 
Zl ASP Asp Leu Thr III A.» Asn He Gly v,l V,! Al. Asp ser Thr 
Asp Ash ser Leu III V,l Lys Leu Al, Lys Thr Leu Ser Asp Leu Asp 
Al, V,l Asn Thr Lys Thr Leu Thr Ala Ser Asp Lys V.l Thr v.1 A.p 
ser Gly Zl A.n Thr Al. Lys III Glh Asn Gly Asp Leu Thr Phe Ser 
Ly. Ill A.h Thr Gly Al, Thr Pro Al, Thr Asn Ser Ly. Thr He Tyr 

7 90 

Gly v.1 Asp Gly Leu Lys Phe Thr A.p Asn Ash Gly He Al, «. Asp 
Gly Thr Thr Tyr He Thr Ly. Asp Lys v,l Gly Phe Al, Lys Glh Asp 
Gly ser Leu Asp Ly. Ser Lys Pro Tyr Leu A.p Ly. Asp Lys Leu Lys 
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Val Gly Glu Val Glu He Thr Thr Asn Gly lie Asn Ala Gly Oly Lys 

855 860 
Ala Til Thr Gly Leu Ser Asn Thr Leu Thr Asp Ala Thr Asn Ala Thr 



Thr Gly His Val Thr Gin Leu Gly He Val Asp Ser Thr Asp Lys Thr 



870 875 
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Arg Ala Ala Ser He Gly Asp Val Leu Asn Ala Gly Phe Asn Leu Lys 

3 905 910 



Asn Asn Gly Asp Ala Lys Asp Phe Val Ser Thr Tyr Asp Thr Val Asp 

915 920 

Phe lie Asn Gly Asn Ala Thr Thr Ala Lys Val Thr Tyr Asp Gly Lys 

qoA 935 940 

Ala Ser Lys Val Ala Tyr Asp Val Asn Val Asp Gly Thr Thr He His 



Thr Leu Thr Lys Thr Asp Ala Lys Gly Asp Lys Ala He Asn Phe Ser 



930 935 940 

Lys Val Ala Tyr Asp Val Asn Val Asp Gly tm 

945 950 955 v, 

Ill Thr Gly Ala Asp Gly Asn Lys Asn Gin He Gly Val Lys Thr Thr 
965 970 
Lvs Thr Asp Ala Lys Gly Asp Lys Ala He 
- 9 * 0 - - - - 985 " " 990 

Val Asn Ser Gly Asp Asp Lys Ala Leu He Asn Ala Lys Asp He Ala 

995 1000 10 u 

Asp Asn Leu Asn Thr Leu Ala Gly Glu He Arg Asn Thr Lys Gly Thr 

1010 1015 1020 

Ala Asp Thr Ala Leu Gin Thr Phe Gin Val^Lys Lys Val Lys Glu_Asn 

g Gly Asp Asp ^AsnTsp Ala Asp Thr H^Thr Val Gly Lys^sp Ala 

f? Lys Thr Asn Gin"" Asn Thr Leu Lys Leu Lys Gly Lys Asn Gly Leu 
¥*> 1060 i° 65 10 

Asp lie Gin Thr Asn Lys Asp Gly Thr Val Thr Phe Gly He Asn Thr 

IW 1075 1080 

H Gin Ser Gly Leu Lys Ala Gly Asn Asn Thr Thr Leu Asn Asn Asn Gly 

f Leaser He Lys Asn ThrTla Gly Asn Glu^GlnTle Gin Val Gly^Ala 

| asp Gly Val ^^Tla Lys Val Asn^snTly Val Val Gly^la Gly 

fi He Asp Gly Thr^hr Arg He Thr Arg Asp Glu He Gly Phe Ala Gly 
W 1140 1X45 X1 

Q Thr Asn Gly Ser Leu Asp Lys Ser Lys Pro His Leu Ser Lys Asp Gly 

M= 1155 1160 H65 

He Asn Ala Gly Gly Lys Lys He Thr Asn He Gin Ser Gly Glu He 



1175 H80 



Ala Gin Asn Ser Asn Asp Ala Val Thr Gly Gly Lys He Tyr Asp Leu 
185 ll 90 1195 



Lys Thr Glu Leu Glu Asn Lys He Server Thr Ala Lys Thr^Ala Gin 
Asn Ser Leu -Hie"" Phe Ser Val^Ala^sp Glu Gin Gly^Asn Asn Phe 

Thr Val Ser^sn Pro Tyr Ser Se^Tyr Asp Thr Ser Lys Thr Ser Asp 

1235 I 240 12 

Val He Thr Phe Ala Gly Glu Asn Gly He Thr Thr Lys Val Asn Lys 

Gly"" Val Arg Val Oly"" Asp Gin Thr^LysTly Leu Thr Thr^Pro 

Zl Leu Thr Val Gly"sn Asn Asn Gly LysTly He Val He Asp Ser 

1285 1290 

Gin Asn Gly Gin Asn Thr He Thr Gly Leu Ser Asn Thr Leu Ala Asn 
1300 1305 1310 
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val Thr Asn Asp Lys Gly Serial Arg Thr Thr Glu^Gln Gly Lys He 

He Lys^sp Glu Asp Lys ThrTrg Ala Ala Ser lie Val Asp Val Leu 

* n 1335 1340 

SerTla Gly Phe Asn Leu Gin Gly Asn Gly^Glu Ala Val Asp Phejal 

Ser Thr Tyr Asp Thr"S Asn Phe Ala^Asp Gly Asn Ala Thrjhr Ala 

Lys val Thr TyrTsp Asp Thr Ser Lys^hr Ser Lys Val Val Tyr Asp 

1380 1385 
val Asn Val Asp Asp Thr Thr_He Glu Val Lys Asp^Lys Lys Leu Gly 

Val Lys'Thr Thr Thr Leu Thrier Thr Gly Thr Gly Ala Asn Lys Phe 

1415 142 0 

Ala Leu Ser Asn Gin Ala Thr Gly Asp Ala^Leu Val Lys Ala Ser^sp 

He val Ala His LeuTsn Thr Leu Ser Gly Asp He Gin Thr Ala Lys 
• - - - - - - - 1445 - - - - - 1450 x ^ - - - - 

Gly Ala Ser Gin Ala Asn Ser Ser Ala Gly Tyr Val Asp Ala Asp Gly 

1460 1465 
Asn Lys val He Tyr Asp Ser Thr Asp Asn Lys Tyr Tyr Gin Ala Lys 

147*5 1480 1485 

Asn Asp Gly Thr Val A-p Ly. Thr Lys Glu Val^la Lys Asp Lys Leu 

5 Val^a Gin Ala Gin Thr'pro Asp Gly Thr^Leu Ala Gin Met Asnjal 

| Lys Ser Val He Asn^ys Glu Gin Val^Asp Ala Asn Lys^ys Gin 

U Gly He Asn GluTsp Asn Ala Phejal^ys Gly Leu Glu^Lys Ala Ala 

S ser Asp Asn'Lys Thr Lys Asn AlaTla Val Thr Val Gly Asp Leu Asn 

1560 Idod 
f Ala ValTla Gin Thr Pro Leu Thr Phe Ala Gly^p Thr Gly Thr Thr 

Ala"y°s Lys Leu Gly Glu^hr Leu Thr He Lys Gly Gly Gin Thr Asp 

1590 lbyz> 
Thr Asn Lys Leu Thr Asp Asn Asn He Gly Val Val Ala Gly Thr Asp 
1605 1610 



Gly Phe Thr Val Lys Leu Ala Lys Asp Leu Thr Asn Leu Asn Ser Val 
y 1625 1630 

5 Asp Asp Lys Gly Val Ser 
1640 1645 



1620 1625 1630 

AS n Ala Gly Gly Thr Lys He Asp Asp Lys Gly Val Ser Phe Val Asp 



ser Ser Gly. Gin Ala Lys Ala Asn Thr Pro Val^eu Ser Ala Asn Gly 

LeuTsp Leu Gly Gly LysTal He 'Ser Asn Val Gly Lys Gly Thr Lys 

1670 lo / 3 

Asp Thr Asp Alalia Asn Val Gin Gln^Leu Asn Glu Val Arg^n Leu 

Leu Gly Leu Gly^sn Ala Gly Asn^Asp Asn Ala Asp Gly_Asn Gin Val 

A sn He AlaTsp He Lys Lys Asp'pro Asn Ser Gly Ser Ser Ser Asn 

171 S 1720 1725 

Arg Thr val He Lys Ala Gly Thr Val Leu Gly^ly Lys Gly Asn Asn 

Asp^r Glu Lys Leu Ala^hr Gly Gly Val^ln val Gly Val Asp^ 

As P Gly Asn Ala AsnTly Asp Leu Ser Asn Val Trp Val Lys Thr Gin 
1765 1770 
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Lys Asp Gly ser Lys Lys Ala Leu^Leu Ala Thr Tyr Asn^Ala Ala Gly 
Gin Thr Asn'Jyr Leu Thr Asn Asn'pro Ala Glu Ala lie Asp Arg He 

T?qc: 1800 180b 

Asn Glu Gin Gly He Arg Phe Phe His Val Asn Asp Gly Asn Gin Glu 

1815 lozU 
Pro"* val Gin Gly Arg Asn Gly He Aspjer Ser Ala Ser Qly^yj 

Ser val Ala lie"" Phe Gin Ala Lys Ala Asp Gly Glu Ala Ala 
1845 1850 
Val Ala lie Gly Arg Gin Thr Gin Ala Gly Asn Gin Ser lie Ala He 

i flfin 1865 
Gly Asp Asn Ala Gin Ala Thr^ly Asp Gin Ser Ile^Ala He Gly Thr 

Gly Asn'vll Val Ala Gly LysTs Ser Gly Ala He Gly As P Pro Ser 

1895 iyuu 
Thr'va 0 ! Lys Ala Asp Asn Ser Tyr Ser Val Gly Asn Asn Asn Gin Phe 

1910 191b 
Thr Asp Ala Thr Gin Thr Asp Val Phe Gly Val Gly Asn Asn lie Thr 

1925 1930 
Val Thr Glu Ser Asn Ser Val Ala Leu Gly Ser Asn Ser Ala He Ser 

1 94 0 194 5 

Ala Gly Thr His Ala Gly Thr^Gln Ala Lys Lys Ser^Asp Gly Thr Ala 
Gly Thr'Thr Thr Thr Ala GlyTla Thr Gly Thr Val Lys Gly Phe Ala 

1975 1980 

GlyTln Thr Ala Val Gly Ala Val Ser Vainly Ala Ser Gly Ala^lu 
IS Arg He Gin Asn 1 "? Ala Ala Gly^lu Val Ser Ala Thr^er Thr 
Asp Ala Val AsnTly Ser Gin Leu^yr Lys Ala Thr Gln^ly He Ala 



2020 

Asn Ala Thr Asn Glu Leu Aap^ij Arg He His Gln^sn Glu Asn Lys 
Ala AsnTla Gly He Ser SerTla Met Ala Met^la Ser Met Pro Gin 
Ala'Jyr He Pro Gly Arg"" Met Val Thr^ly Gly He Ala Thr^is 
A sn Gly Gin Gly Ala"" Ala Val Gly^eu Ser Lys Leu se^Asp Asn 

Gly Gin Trp Valine Lys He Asn Gly Ser Ala Asp Thr Gin Gly His 

2100 2105 

Val Gly Ala Ala Val Gly Ala Gly Phe His Phe 
2115 2120 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gly Thr Val Leu Gly Gly Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Gly He Gly He Ser Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala 

1 5 10 15 

Arg Gly Asp Lys Ser He Ala He Gly Asp He Ala Gin Ala Leu Gly 

20 25 30 

Ser Gin Ser He Ala He Gly Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 

Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg Gly Asp Lys Ser 

15 10 15 

He Ala He Gly Asp He Ala Gin 
20 



(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Gly Ala Arg Gly Cys Asn Gly Ala Tyr Gly Gly Asn Gly Gly Asn Ala 

1 5 10 15 

Ala Arg 
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(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Tyr Thr Gly Asn Gly Cys Asp Ala Thr Arg Thr Cys Asn Cys Cys Asp 

1 5 « 

Ala Thr 

" ~ (2 )" INFORMATION "FOR SEQ-IB NO : 16: - ■ ■ - - - - - - - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GAAGCGGACG GGGGGAAA 

(2) INFORMATION FOR SEQ ID NO : 17 : 



|y is 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



O (D) TOPOLOGY : linear 

yi (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TTGCGCAATG TCACCAAT 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 18 
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GGGGGAAAGG CGGAGCCAAT GCGCGCGGTG ATAAATCCAT TGCTATTGGT 



GACATTGCGC AA 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg Gly Asp Lys 



He Ala He Gly Asp He Ala Gin 
20 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TCATCATTGG AAAACGTTCT TCGGGGCGAA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

( c ) STRANDEDNE S S : unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
CGGTCAGCTT AGGCGTGGTT 
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